Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 1000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 117.3 KiB |
| Average record size in memory | 120.1 B |
Variable types
| Numeric | 6 |
|---|---|
| Categorical | 9 |
race is highly imbalanced (65.3%) | Imbalance |
native-country is highly imbalanced (82.1%) | Imbalance |
capital-gain has 919 (91.9%) zeros | Zeros |
capital-loss has 950 (95.0%) zeros | Zeros |
Reproduction
| Analysis started | 2024-03-03 09:42:44.214478 |
|---|---|
| Analysis finished | 2024-03-03 09:42:58.023847 |
| Duration | 13.81 seconds |
| Software version | ydata-profiling vv4.6.5 |
| Download configuration | config.json |
age
Real number (ℝ)
| Distinct | 66 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.051 |
| Minimum | 17 |
|---|---|
| Maximum | 90 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 17 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 28 |
| median | 36 |
| Q3 | 46 |
| 95-th percentile | 63 |
| Maximum | 90 |
| Range | 73 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 13.34948 |
|---|---|
| Coefficient of variation (CV) | 0.35083124 |
| Kurtosis | -0.042832687 |
| Mean | 38.051 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.58910654 |
| Sum | 38051 |
| Variance | 178.20861 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 31 | 33 | 3.3% |
| 43 | 33 | 3.3% |
| 34 | 32 | 3.2% |
| 23 | 30 | 3.0% |
| 33 | 30 | 3.0% |
| 44 | 28 | 2.8% |
| 35 | 28 | 2.8% |
| 24 | 28 | 2.8% |
| 36 | 28 | 2.8% |
| 42 | 28 | 2.8% |
| Other values (56) | 702 |
| Value | Count | Frequency (%) |
| 17 | 20 | |
| 18 | 16 | |
| 19 | 21 | |
| 20 | 23 | |
| 21 | 17 | |
| 22 | 22 | |
| 23 | 30 | |
| 24 | 28 | |
| 25 | 23 | |
| 26 | 21 |
| Value | Count | Frequency (%) |
| 90 | 1 | |
| 81 | 1 | |
| 80 | 1 | |
| 79 | 1 | |
| 78 | 1 | |
| 77 | 1 | |
| 76 | 2 | |
| 75 | 1 | |
| 74 | 1 | |
| 73 | 1 |
workclass
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| Private | |
|---|---|
| Self-emp-not-inc | |
| Local-gov | 68 |
| ? | 62 |
| State-gov | 37 |
| Other values (2) | 54 |
Length
| Max length | 17 |
|---|---|
| Median length | 8 |
| Mean length | 8.816 |
| Min length | 2 |
Characters and Unicode
| Total characters | 8816 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | State-gov |
|---|---|
| 2nd row | Self-emp-not-inc |
| 3rd row | Private |
| 4th row | Private |
| 5th row | Private |
Common Values
| Value | Count | Frequency (%) |
| Private | 698 | |
| Self-emp-not-inc | 81 | 8.1% |
| Local-gov | 68 | 6.8% |
| ? | 62 | 6.2% |
| State-gov | 37 | 3.7% |
| Self-emp-inc | 33 | 3.3% |
| Federal-gov | 21 | 2.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| private | 698 | |
| self-emp-not-inc | 81 | 8.1% |
| local-gov | 68 | 6.8% |
| 62 | 6.2% | |
| state-gov | 37 | 3.7% |
| self-emp-inc | 33 | 3.3% |
| federal-gov | 21 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1005 | |
| 1000 | ||
| t | 853 | |
| a | 824 | |
| v | 824 | |
| i | 812 | |
| r | 719 | |
| P | 698 | |
| - | 435 | 4.9% |
| o | 275 | 3.1% |
| Other values (12) | 1371 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6381 | |
| Space Separator | 1000 | 11.3% |
| Uppercase Letter | 938 | 10.6% |
| Dash Punctuation | 435 | 4.9% |
| Other Punctuation | 62 | 0.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1005 | |
| t | 853 | |
| a | 824 | |
| v | 824 | |
| i | 812 | |
| r | 719 | |
| o | 275 | 4.3% |
| l | 203 | 3.2% |
| n | 195 | 3.1% |
| c | 182 | 2.9% |
| Other values (5) | 489 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 698 | |
| S | 151 | 16.1% |
| L | 68 | 7.2% |
| F | 21 | 2.2% |
Space Separator
| Value | Count | Frequency (%) |
| 1000 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 435 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 62 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7319 | |
| Common | 1497 | 17.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1005 | |
| t | 853 | |
| a | 824 | |
| v | 824 | |
| i | 812 | |
| r | 719 | |
| P | 698 | |
| o | 275 | 3.8% |
| l | 203 | 2.8% |
| n | 195 | 2.7% |
| Other values (9) | 911 |
Common
| Value | Count | Frequency (%) |
| 1000 | ||
| - | 435 | |
| ? | 62 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8816 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1005 | |
| 1000 | ||
| t | 853 | |
| a | 824 | |
| v | 824 | |
| i | 812 | |
| r | 719 | |
| P | 698 | |
| - | 435 | 4.9% |
| o | 275 | 3.1% |
| Other values (12) | 1371 |
fnlwgt
Real number (ℝ)
| Distinct | 987 |
|---|---|
| Distinct (%) | 98.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 191904.98 |
| Minimum | 21174 |
|---|---|
| Maximum | 1033222 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 21174 |
|---|---|
| 5-th percentile | 43274.15 |
| Q1 | 115041.25 |
| median | 180590.5 |
| Q3 | 247152.25 |
| 95-th percentile | 382841.9 |
| Maximum | 1033222 |
| Range | 1012048 |
| Interquartile range (IQR) | 132111 |
Descriptive statistics
| Standard deviation | 108125.54 |
|---|---|
| Coefficient of variation (CV) | 0.56343272 |
| Kurtosis | 5.7469451 |
| Mean | 191904.98 |
| Median Absolute Deviation (MAD) | 65749 |
| Skewness | 1.483933 |
| Sum | 1.9190498 × 108 |
| Variance | 1.1691133 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 116632 | 3 | 0.3% |
| 108293 | 2 | 0.2% |
| 32185 | 2 | 0.2% |
| 217460 | 2 | 0.2% |
| 111567 | 2 | 0.2% |
| 368700 | 2 | 0.2% |
| 182556 | 2 | 0.2% |
| 191277 | 2 | 0.2% |
| 92262 | 2 | 0.2% |
| 194636 | 2 | 0.2% |
| Other values (977) | 979 |
| Value | Count | Frequency (%) |
| 21174 | 1 | |
| 21906 | 1 | |
| 22463 | 1 | |
| 23780 | 1 | |
| 24215 | 1 | |
| 25429 | 1 | |
| 25826 | 1 | |
| 25828 | 1 | |
| 27053 | 1 | |
| 27337 | 1 |
| Value | Count | Frequency (%) |
| 1033222 | 1 | |
| 860348 | 1 | |
| 680390 | 1 | |
| 635913 | 1 | |
| 633742 | 1 | |
| 556660 | 1 | |
| 544091 | 1 | |
| 543162 | 1 | |
| 543028 | 1 | |
| 538583 | 1 |
education
Categorical
| Distinct | 16 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| HS-grad | |
|---|---|
| Some-college | |
| Bachelors | |
| Masters | |
| Assoc-voc | |
| Other values (11) |
Length
| Max length | 13 |
|---|---|
| Median length | 12 |
| Mean length | 9.438 |
| Min length | 4 |
Characters and Unicode
| Total characters | 9438 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Bachelors |
|---|---|
| 2nd row | Bachelors |
| 3rd row | HS-grad |
| 4th row | 11th |
| 5th row | Bachelors |
Common Values
| Value | Count | Frequency (%) |
| HS-grad | 321 | |
| Some-college | 225 | |
| Bachelors | 166 | |
| Masters | 54 | 5.4% |
| Assoc-voc | 48 | 4.8% |
| 11th | 46 | 4.6% |
| Assoc-acdm | 35 | 3.5% |
| 10th | 21 | 2.1% |
| 9th | 16 | 1.6% |
| 7th-8th | 15 | 1.5% |
| Other values (6) | 53 | 5.3% |
Length
| Value | Count | Frequency (%) |
| hs-grad | 321 | |
| some-college | 225 | |
| bachelors | 166 | |
| masters | 54 | 5.4% |
| assoc-voc | 48 | 4.8% |
| 11th | 46 | 4.6% |
| assoc-acdm | 35 | 3.5% |
| 10th | 21 | 2.1% |
| 9th | 16 | 1.6% |
| 7th-8th | 15 | 1.5% |
| Other values (6) | 53 | 5.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1000 | 10.6% | |
| e | 911 | 9.7% |
| o | 809 | 8.6% |
| - | 672 | 7.1% |
| l | 628 | 6.7% |
| a | 590 | 6.3% |
| c | 583 | 6.2% |
| r | 567 | 6.0% |
| S | 546 | 5.8% |
| g | 546 | 5.8% |
| Other values (22) | 2586 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6336 | |
| Uppercase Letter | 1196 | 12.7% |
| Space Separator | 1000 | 10.6% |
| Dash Punctuation | 672 | 7.1% |
| Decimal Number | 234 | 2.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 911 | |
| o | 809 | |
| l | 628 | |
| a | 590 | |
| c | 583 | |
| r | 567 | |
| g | 546 | |
| s | 459 | |
| d | 356 | 5.6% |
| h | 329 | 5.2% |
| Other values (4) | 558 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 129 | |
| 0 | 21 | 9.0% |
| 9 | 16 | 6.8% |
| 7 | 15 | 6.4% |
| 8 | 15 | 6.4% |
| 5 | 11 | 4.7% |
| 6 | 11 | 4.7% |
| 2 | 9 | 3.8% |
| 4 | 7 | 3.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 546 | |
| H | 321 | |
| B | 166 | 13.9% |
| A | 83 | 6.9% |
| M | 54 | 4.5% |
| D | 14 | 1.2% |
| P | 12 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 1000 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 672 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7532 | |
| Common | 1906 | 20.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 911 | |
| o | 809 | |
| l | 628 | |
| a | 590 | 7.8% |
| c | 583 | 7.7% |
| r | 567 | 7.5% |
| S | 546 | 7.2% |
| g | 546 | 7.2% |
| s | 459 | 6.1% |
| d | 356 | 4.7% |
| Other values (11) | 1537 |
Common
| Value | Count | Frequency (%) |
| 1000 | ||
| - | 672 | |
| 1 | 129 | 6.8% |
| 0 | 21 | 1.1% |
| 9 | 16 | 0.8% |
| 7 | 15 | 0.8% |
| 8 | 15 | 0.8% |
| 5 | 11 | 0.6% |
| 6 | 11 | 0.6% |
| 2 | 9 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9438 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1000 | 10.6% | |
| e | 911 | 9.7% |
| o | 809 | 8.6% |
| - | 672 | 7.1% |
| l | 628 | 6.7% |
| a | 590 | 6.3% |
| c | 583 | 6.2% |
| r | 567 | 6.0% |
| S | 546 | 5.8% |
| g | 546 | 5.8% |
| Other values (22) | 2586 |
education-num
Real number (ℝ)
| Distinct | 16 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.084 |
| Minimum | 1 |
|---|---|
| Maximum | 16 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 9 |
| median | 10 |
| Q3 | 12 |
| 95-th percentile | 14 |
| Maximum | 16 |
| Range | 15 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.5486152 |
|---|---|
| Coefficient of variation (CV) | 0.25273852 |
| Kurtosis | 0.84022152 |
| Mean | 10.084 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.37139661 |
| Sum | 10084 |
| Variance | 6.4954394 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 321 | |
| 10 | 225 | |
| 13 | 166 | |
| 14 | 54 | 5.4% |
| 11 | 48 | 4.8% |
| 7 | 46 | 4.6% |
| 12 | 35 | 3.5% |
| 6 | 21 | 2.1% |
| 5 | 16 | 1.6% |
| 4 | 15 | 1.5% |
| Other values (6) | 53 | 5.3% |
| Value | Count | Frequency (%) |
| 1 | 2 | 0.2% |
| 2 | 7 | 0.7% |
| 3 | 11 | 1.1% |
| 4 | 15 | 1.5% |
| 5 | 16 | 1.6% |
| 6 | 21 | 2.1% |
| 7 | 46 | 4.6% |
| 8 | 9 | 0.9% |
| 9 | 321 | |
| 10 | 225 |
| Value | Count | Frequency (%) |
| 16 | 14 | 1.4% |
| 15 | 10 | 1.0% |
| 14 | 54 | 5.4% |
| 13 | 166 | |
| 12 | 35 | 3.5% |
| 11 | 48 | 4.8% |
| 10 | 225 | |
| 9 | 321 | |
| 8 | 9 | 0.9% |
| 7 | 46 | 4.6% |
marital-status
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| Married-civ-spouse | |
|---|---|
| Never-married | |
| Divorced | |
| Widowed | 33 |
| Separated | 28 |
| Other values (2) | 16 |
Length
| Max length | 22 |
|---|---|
| Median length | 19 |
| Mean length | 15.349 |
| Min length | 8 |
Characters and Unicode
| Total characters | 15349 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Never-married |
|---|---|
| 2nd row | Married-civ-spouse |
| 3rd row | Divorced |
| 4th row | Married-civ-spouse |
| 5th row | Married-civ-spouse |
Common Values
| Value | Count | Frequency (%) |
| Married-civ-spouse | 443 | |
| Never-married | 344 | |
| Divorced | 136 | 13.6% |
| Widowed | 33 | 3.3% |
| Separated | 28 | 2.8% |
| Married-spouse-absent | 15 | 1.5% |
| Married-AF-spouse | 1 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| married-civ-spouse | 443 | |
| never-married | 344 | |
| divorced | 136 | 13.6% |
| widowed | 33 | 3.3% |
| separated | 28 | 2.8% |
| married-spouse-absent | 15 | 1.5% |
| married-af-spouse | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2190 | |
| r | 2114 | |
| i | 1415 | |
| - | 1262 | |
| d | 1033 | 6.7% |
| 1000 | 6.5% | |
| s | 933 | 6.1% |
| v | 923 | 6.0% |
| a | 874 | 5.7% |
| o | 628 | 4.1% |
| Other values (15) | 2977 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12085 | |
| Dash Punctuation | 1262 | 8.2% |
| Uppercase Letter | 1002 | 6.5% |
| Space Separator | 1000 | 6.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2190 | |
| r | 2114 | |
| i | 1415 | |
| d | 1033 | |
| s | 933 | |
| v | 923 | |
| a | 874 | 7.2% |
| o | 628 | 5.2% |
| c | 579 | 4.8% |
| p | 487 | 4.0% |
| Other values (6) | 909 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 459 | |
| N | 344 | |
| D | 136 | 13.6% |
| W | 33 | 3.3% |
| S | 28 | 2.8% |
| A | 1 | 0.1% |
| F | 1 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1262 |
Space Separator
| Value | Count | Frequency (%) |
| 1000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13087 | |
| Common | 2262 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2190 | |
| r | 2114 | |
| i | 1415 | |
| d | 1033 | |
| s | 933 | |
| v | 923 | |
| a | 874 | 6.7% |
| o | 628 | 4.8% |
| c | 579 | 4.4% |
| p | 487 | 3.7% |
| Other values (13) | 1911 |
Common
| Value | Count | Frequency (%) |
| - | 1262 | |
| 1000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15349 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2190 | |
| r | 2114 | |
| i | 1415 | |
| - | 1262 | |
| d | 1033 | 6.7% |
| 1000 | 6.5% | |
| s | 933 | 6.1% |
| v | 923 | 6.0% |
| a | 874 | 5.7% |
| o | 628 | 4.1% |
| Other values (15) | 2977 |
occupation
Categorical
| Distinct | 15 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| Craft-repair | |
|---|---|
| Prof-specialty | |
| Exec-managerial | |
| Sales | |
| Other-service | |
| Other values (10) |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 13.139 |
| Min length | 2 |
Characters and Unicode
| Total characters | 13139 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Adm-clerical |
|---|---|
| 2nd row | Exec-managerial |
| 3rd row | Handlers-cleaners |
| 4th row | Handlers-cleaners |
| 5th row | Prof-specialty |
Common Values
| Value | Count | Frequency (%) |
| Craft-repair | 126 | |
| Prof-specialty | 124 | |
| Exec-managerial | 124 | |
| Sales | 112 | |
| Other-service | 107 | |
| Adm-clerical | 94 | |
| ? | 62 | |
| Machine-op-inspct | 61 | |
| Transport-moving | 52 | |
| Tech-support | 44 | 4.4% |
| Other values (5) | 94 |
Length
| Value | Count | Frequency (%) |
| craft-repair | 126 | |
| prof-specialty | 124 | |
| exec-managerial | 124 | |
| sales | 112 | |
| other-service | 107 | |
| adm-clerical | 94 | |
| 62 | ||
| machine-op-inspct | 61 | |
| transport-moving | 52 | |
| tech-support | 44 | 4.4% |
| Other values (5) | 94 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1315 | 10.0% |
| r | 1239 | 9.4% |
| a | 1184 | 9.0% |
| 1000 | 7.6% | |
| - | 890 | 6.8% |
| i | 861 | 6.6% |
| c | 769 | 5.9% |
| s | 640 | 4.9% |
| l | 634 | 4.8% |
| t | 546 | 4.2% |
| Other values (23) | 4061 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10248 | |
| Space Separator | 1000 | 7.6% |
| Uppercase Letter | 939 | 7.1% |
| Dash Punctuation | 890 | 6.8% |
| Other Punctuation | 62 | 0.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1315 | |
| r | 1239 | |
| a | 1184 | |
| i | 861 | |
| c | 769 | 7.5% |
| s | 640 | 6.2% |
| l | 634 | 6.2% |
| t | 546 | 5.3% |
| p | 512 | 5.0% |
| n | 498 | 4.9% |
| Other values (10) | 2050 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 143 | |
| C | 126 | |
| E | 124 | |
| S | 112 | |
| O | 107 | |
| T | 96 | |
| A | 95 | |
| M | 61 | |
| H | 43 | 4.6% |
| F | 32 | 3.4% |
Space Separator
| Value | Count | Frequency (%) |
| 1000 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 890 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 62 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11187 | |
| Common | 1952 | 14.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1315 | |
| r | 1239 | |
| a | 1184 | 10.6% |
| i | 861 | 7.7% |
| c | 769 | 6.9% |
| s | 640 | 5.7% |
| l | 634 | 5.7% |
| t | 546 | 4.9% |
| p | 512 | 4.6% |
| n | 498 | 4.5% |
| Other values (20) | 2989 |
Common
| Value | Count | Frequency (%) |
| 1000 | ||
| - | 890 | |
| ? | 62 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13139 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1315 | 10.0% |
| r | 1239 | 9.4% |
| a | 1184 | 9.0% |
| 1000 | 7.6% | |
| - | 890 | 6.8% |
| i | 861 | 6.6% |
| c | 769 | 5.9% |
| s | 640 | 4.9% |
| l | 634 | 4.8% |
| t | 546 | 4.2% |
| Other values (23) | 4061 |
relationship
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| Husband | |
|---|---|
| Not-in-family | |
| Own-child | |
| Unmarried | |
| Wife |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 10.179 |
| Min length | 5 |
Characters and Unicode
| Total characters | 10179 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not-in-family |
|---|---|
| 2nd row | Husband |
| 3rd row | Not-in-family |
| 4th row | Husband |
| 5th row | Wife |
Common Values
| Value | Count | Frequency (%) |
| Husband | 376 | |
| Not-in-family | 279 | |
| Own-child | 151 | |
| Unmarried | 109 | 10.9% |
| Wife | 61 | 6.1% |
| Other-relative | 24 | 2.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| husband | 376 | |
| not-in-family | 279 | |
| own-child | 151 | |
| unmarried | 109 | 10.9% |
| wife | 61 | 6.1% |
| other-relative | 24 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1000 | 9.8% | |
| n | 915 | 9.0% |
| i | 903 | 8.9% |
| a | 788 | 7.7% |
| - | 733 | 7.2% |
| d | 636 | 6.2% |
| l | 454 | 4.5% |
| m | 388 | 3.8% |
| H | 376 | 3.7% |
| u | 376 | 3.7% |
| Other values (16) | 3610 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7446 | |
| Space Separator | 1000 | 9.8% |
| Uppercase Letter | 1000 | 9.8% |
| Dash Punctuation | 733 | 7.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 915 | |
| i | 903 | |
| a | 788 | |
| d | 636 | 8.5% |
| l | 454 | 6.1% |
| m | 388 | 5.2% |
| u | 376 | 5.0% |
| s | 376 | 5.0% |
| b | 376 | 5.0% |
| f | 340 | 4.6% |
| Other values (9) | 1894 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 376 | |
| N | 279 | |
| O | 175 | |
| U | 109 | 10.9% |
| W | 61 | 6.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1000 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 733 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8446 | |
| Common | 1733 | 17.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 915 | 10.8% |
| i | 903 | 10.7% |
| a | 788 | 9.3% |
| d | 636 | 7.5% |
| l | 454 | 5.4% |
| m | 388 | 4.6% |
| H | 376 | 4.5% |
| u | 376 | 4.5% |
| s | 376 | 4.5% |
| b | 376 | 4.5% |
| Other values (14) | 2858 |
Common
| Value | Count | Frequency (%) |
| 1000 | ||
| - | 733 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10179 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1000 | 9.8% | |
| n | 915 | 9.0% |
| i | 903 | 8.9% |
| a | 788 | 7.7% |
| - | 733 | 7.2% |
| d | 636 | 6.2% |
| l | 454 | 4.5% |
| m | 388 | 3.8% |
| H | 376 | 3.7% |
| u | 376 | 3.7% |
| Other values (16) | 3610 |
race
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| White | |
|---|---|
| Black | |
| Asian-Pac-Islander | 27 |
| Amer-Indian-Eskimo | 10 |
| Other | 6 |
Length
| Max length | 19 |
|---|---|
| Median length | 6 |
| Mean length | 6.481 |
| Min length | 6 |
Characters and Unicode
| Total characters | 6481 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | White |
|---|---|
| 2nd row | White |
| 3rd row | White |
| 4th row | Black |
| 5th row | Black |
Common Values
| Value | Count | Frequency (%) |
| White | 847 | |
| Black | 110 | 11.0% |
| Asian-Pac-Islander | 27 | 2.7% |
| Amer-Indian-Eskimo | 10 | 1.0% |
| Other | 6 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| white | 847 | |
| black | 110 | 11.0% |
| asian-pac-islander | 27 | 2.7% |
| amer-indian-eskimo | 10 | 1.0% |
| other | 6 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1000 | ||
| i | 894 | |
| e | 890 | |
| t | 853 | |
| h | 853 | |
| W | 847 | |
| a | 201 | 3.1% |
| c | 137 | 2.1% |
| l | 137 | 2.1% |
| k | 120 | 1.9% |
| Other values (13) | 549 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4333 | |
| Uppercase Letter | 1074 | 16.6% |
| Space Separator | 1000 | 15.4% |
| Dash Punctuation | 74 | 1.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 894 | |
| e | 890 | |
| t | 853 | |
| h | 853 | |
| a | 201 | 4.6% |
| c | 137 | 3.2% |
| l | 137 | 3.2% |
| k | 120 | 2.8% |
| n | 74 | 1.7% |
| s | 64 | 1.5% |
| Other values (4) | 110 | 2.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 847 | |
| B | 110 | 10.2% |
| A | 37 | 3.4% |
| I | 37 | 3.4% |
| P | 27 | 2.5% |
| E | 10 | 0.9% |
| O | 6 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 1000 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 74 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5407 | |
| Common | 1074 | 16.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 894 | |
| e | 890 | |
| t | 853 | |
| h | 853 | |
| W | 847 | |
| a | 201 | 3.7% |
| c | 137 | 2.5% |
| l | 137 | 2.5% |
| k | 120 | 2.2% |
| B | 110 | 2.0% |
| Other values (11) | 365 |
Common
| Value | Count | Frequency (%) |
| 1000 | ||
| - | 74 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6481 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1000 | ||
| i | 894 | |
| e | 890 | |
| t | 853 | |
| h | 853 | |
| W | 847 | |
| a | 201 | 3.1% |
| c | 137 | 2.1% |
| l | 137 | 2.1% |
| k | 120 | 1.9% |
| Other values (13) | 549 |
sex
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| Male | |
|---|---|
| Female |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 5.658 |
| Min length | 5 |
Characters and Unicode
| Total characters | 5658 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Male |
|---|---|
| 2nd row | Male |
| 3rd row | Male |
| 4th row | Male |
| 5th row | Female |
Common Values
| Value | Count | Frequency (%) |
| Male | 671 | |
| Female | 329 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 671 | |
| female | 329 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1329 | |
| a | 1000 | |
| 1000 | ||
| l | 1000 | |
| M | 671 | |
| F | 329 | 5.8% |
| m | 329 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3658 | |
| Space Separator | 1000 | 17.7% |
| Uppercase Letter | 1000 | 17.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1329 | |
| a | 1000 | |
| l | 1000 | |
| m | 329 | 9.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 671 | |
| F | 329 |
Space Separator
| Value | Count | Frequency (%) |
| 1000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4658 | |
| Common | 1000 | 17.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1329 | |
| a | 1000 | |
| l | 1000 | |
| M | 671 | |
| F | 329 | 7.1% |
| m | 329 | 7.1% |
Common
| Value | Count | Frequency (%) |
| 1000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5658 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1329 | |
| a | 1000 | |
| 1000 | ||
| l | 1000 | |
| M | 671 | |
| F | 329 | 5.8% |
| m | 329 | 5.8% |
capital-gain
Real number (ℝ)
ZEROS 
| Distinct | 36 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 588.526 |
| Minimum | 0 |
|---|---|
| Maximum | 34095 |
| Zeros | 919 |
| Zeros (%) | 91.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 4115.25 |
| Maximum | 34095 |
| Range | 34095 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2618.5375 |
|---|---|
| Coefficient of variation (CV) | 4.4493149 |
| Kurtosis | 49.452783 |
| Mean | 588.526 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.2480627 |
| Sum | 588526 |
| Variance | 6856738.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 919 | |
| 15024 | 11 | 1.1% |
| 7688 | 8 | 0.8% |
| 7298 | 8 | 0.8% |
| 4386 | 5 | 0.5% |
| 2174 | 4 | 0.4% |
| 5178 | 4 | 0.4% |
| 14084 | 3 | 0.3% |
| 1055 | 3 | 0.3% |
| 594 | 3 | 0.3% |
| Other values (26) | 32 | 3.2% |
| Value | Count | Frequency (%) |
| 0 | 919 | |
| 594 | 3 | 0.3% |
| 1055 | 3 | 0.3% |
| 1111 | 1 | 0.1% |
| 1409 | 1 | 0.1% |
| 2050 | 1 | 0.1% |
| 2174 | 4 | 0.4% |
| 2176 | 1 | 0.1% |
| 2407 | 3 | 0.3% |
| 2463 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 34095 | 1 | 0.1% |
| 25236 | 1 | 0.1% |
| 20051 | 1 | 0.1% |
| 15024 | 11 | |
| 14344 | 1 | 0.1% |
| 14084 | 3 | 0.3% |
| 10605 | 1 | 0.1% |
| 9386 | 1 | 0.1% |
| 8614 | 1 | 0.1% |
| 7688 | 8 |
capital-loss
Real number (ℝ)
ZEROS 
| Distinct | 30 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 92.96 |
| Minimum | 0 |
|---|---|
| Maximum | 2415 |
| Zeros | 950 |
| Zeros (%) | 95.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 32.65 |
| Maximum | 2415 |
| Range | 2415 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 412.44234 |
|---|---|
| Coefficient of variation (CV) | 4.4367721 |
| Kurtosis | 17.451486 |
| Mean | 92.96 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.3441325 |
| Sum | 92960 |
| Variance | 170108.68 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 950 | |
| 1977 | 6 | 0.6% |
| 1902 | 5 | 0.5% |
| 1887 | 4 | 0.4% |
| 2415 | 3 | 0.3% |
| 1762 | 2 | 0.2% |
| 1980 | 2 | 0.2% |
| 1564 | 2 | 0.2% |
| 2179 | 2 | 0.2% |
| 1408 | 2 | 0.2% |
| Other values (20) | 22 | 2.2% |
| Value | Count | Frequency (%) |
| 0 | 950 | |
| 653 | 1 | 0.1% |
| 1340 | 1 | 0.1% |
| 1380 | 2 | 0.2% |
| 1408 | 2 | 0.2% |
| 1485 | 1 | 0.1% |
| 1504 | 1 | 0.1% |
| 1564 | 2 | 0.2% |
| 1573 | 1 | 0.1% |
| 1669 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 2415 | 3 | |
| 2392 | 1 | 0.1% |
| 2377 | 1 | 0.1% |
| 2352 | 1 | 0.1% |
| 2339 | 1 | 0.1% |
| 2206 | 1 | 0.1% |
| 2179 | 2 | |
| 2051 | 1 | 0.1% |
| 2042 | 1 | 0.1% |
| 1980 | 2 |
hours-per-week
Real number (ℝ)
| Distinct | 56 |
|---|---|
| Distinct (%) | 5.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.876 |
| Minimum | 1 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 40 |
| median | 40 |
| Q3 | 45 |
| 95-th percentile | 60 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 12.018114 |
|---|---|
| Coefficient of variation (CV) | 0.30138714 |
| Kurtosis | 2.3381995 |
| Mean | 39.876 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.0036032951 |
| Sum | 39876 |
| Variance | 144.43506 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40 | 472 | |
| 50 | 79 | 7.9% |
| 45 | 61 | 6.1% |
| 60 | 47 | 4.7% |
| 20 | 44 | 4.4% |
| 35 | 38 | 3.8% |
| 30 | 28 | 2.8% |
| 25 | 21 | 2.1% |
| 55 | 21 | 2.1% |
| 38 | 18 | 1.8% |
| Other values (46) | 171 | 17.1% |
| Value | Count | Frequency (%) |
| 1 | 1 | 0.1% |
| 2 | 2 | 0.2% |
| 4 | 1 | 0.1% |
| 5 | 3 | |
| 6 | 3 | |
| 7 | 1 | 0.1% |
| 8 | 3 | |
| 9 | 1 | 0.1% |
| 10 | 7 | |
| 12 | 4 |
| Value | Count | Frequency (%) |
| 99 | 1 | 0.1% |
| 98 | 1 | 0.1% |
| 80 | 5 | 0.5% |
| 75 | 2 | 0.2% |
| 72 | 2 | 0.2% |
| 70 | 9 | 0.9% |
| 65 | 4 | 0.4% |
| 64 | 2 | 0.2% |
| 60 | 47 | |
| 59 | 1 | 0.1% |
native-country
Categorical
IMBALANCE 
| Distinct | 29 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| United-States | |
|---|---|
| Mexico | 20 |
| ? | 18 |
| Puerto-Rico | 4 |
| Cuba | 4 |
| Other values (24) | 52 |
Length
| Max length | 19 |
|---|---|
| Median length | 14 |
| Mean length | 13.304 |
| Min length | 2 |
Characters and Unicode
| Total characters | 13304 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | United-States |
|---|---|
| 2nd row | United-States |
| 3rd row | United-States |
| 4th row | United-States |
| 5th row | Cuba |
Common Values
| Value | Count | Frequency (%) |
| United-States | 902 | |
| Mexico | 20 | 2.0% |
| ? | 18 | 1.8% |
| Puerto-Rico | 4 | 0.4% |
| Cuba | 4 | 0.4% |
| Portugal | 4 | 0.4% |
| Philippines | 4 | 0.4% |
| Germany | 3 | 0.3% |
| India | 3 | 0.3% |
| Iran | 3 | 0.3% |
| Other values (19) | 35 | 3.5% |
Length
| Value | Count | Frequency (%) |
| united-states | 902 | |
| mexico | 20 | 2.0% |
| 18 | 1.8% | |
| puerto-rico | 4 | 0.4% |
| cuba | 4 | 0.4% |
| portugal | 4 | 0.4% |
| philippines | 4 | 0.4% |
| germany | 3 | 0.3% |
| india | 3 | 0.3% |
| iran | 3 | 0.3% |
| Other values (19) | 35 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 2723 | |
| e | 1840 | |
| 1000 | 7.5% | |
| a | 971 | 7.3% |
| i | 959 | 7.2% |
| n | 938 | 7.1% |
| d | 921 | 6.9% |
| - | 910 | 6.8% |
| s | 909 | 6.8% |
| S | 907 | 6.8% |
| Other values (29) | 1226 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9484 | |
| Uppercase Letter | 1892 | 14.2% |
| Space Separator | 1000 | 7.5% |
| Dash Punctuation | 910 | 6.8% |
| Other Punctuation | 18 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2723 | |
| e | 1840 | |
| a | 971 | 10.2% |
| i | 959 | 10.1% |
| n | 938 | 9.9% |
| d | 921 | 9.7% |
| s | 909 | 9.6% |
| o | 48 | 0.5% |
| c | 32 | 0.3% |
| l | 26 | 0.3% |
| Other values (11) | 117 | 1.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 907 | |
| U | 902 | |
| M | 20 | 1.1% |
| P | 15 | 0.8% |
| C | 10 | 0.5% |
| I | 8 | 0.4% |
| R | 6 | 0.3% |
| E | 6 | 0.3% |
| G | 5 | 0.3% |
| H | 4 | 0.2% |
| Other values (5) | 9 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 1000 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 910 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 18 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11376 | |
| Common | 1928 | 14.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 2723 | |
| e | 1840 | |
| a | 971 | 8.5% |
| i | 959 | 8.4% |
| n | 938 | 8.2% |
| d | 921 | 8.1% |
| s | 909 | 8.0% |
| S | 907 | 8.0% |
| U | 902 | 7.9% |
| o | 48 | 0.4% |
| Other values (26) | 258 | 2.3% |
Common
| Value | Count | Frequency (%) |
| 1000 | ||
| - | 910 | |
| ? | 18 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13304 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 2723 | |
| e | 1840 | |
| 1000 | 7.5% | |
| a | 971 | 7.3% |
| i | 959 | 7.2% |
| n | 938 | 7.1% |
| d | 921 | 6.9% |
| - | 910 | 6.8% |
| s | 909 | 6.8% |
| S | 907 | 6.8% |
| Other values (29) | 1226 |
salary
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 768 | |
| 1 | 232 | 23.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 768 | |
| 1 | 232 | 23.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 768 | |
| 1 | 232 | 23.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 768 | |
| 1 | 232 | 23.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 768 | |
| 1 | 232 | 23.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 768 | |
| 1 | 232 | 23.2% |
| age | workclass | fnlwgt | education | education-num | marital-status | occupation | relationship | race | sex | capital-gain | capital-loss | hours-per-week | native-country | salary | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 39 | State-gov | 77516 | Bachelors | 13 | Never-married | Adm-clerical | Not-in-family | White | Male | 2174 | 0 | 40 | United-States | 0 |
| 1 | 50 | Self-emp-not-inc | 83311 | Bachelors | 13 | Married-civ-spouse | Exec-managerial | Husband | White | Male | 0 | 0 | 13 | United-States | 0 |
| 2 | 38 | Private | 215646 | HS-grad | 9 | Divorced | Handlers-cleaners | Not-in-family | White | Male | 0 | 0 | 40 | United-States | 0 |
| 3 | 53 | Private | 234721 | 11th | 7 | Married-civ-spouse | Handlers-cleaners | Husband | Black | Male | 0 | 0 | 40 | United-States | 0 |
| 4 | 28 | Private | 338409 | Bachelors | 13 | Married-civ-spouse | Prof-specialty | Wife | Black | Female | 0 | 0 | 40 | Cuba | 0 |
| 5 | 37 | Private | 284582 | Masters | 14 | Married-civ-spouse | Exec-managerial | Wife | White | Female | 0 | 0 | 40 | United-States | 0 |
| 6 | 49 | Private | 160187 | 9th | 5 | Married-spouse-absent | Other-service | Not-in-family | Black | Female | 0 | 0 | 16 | Jamaica | 0 |
| 7 | 52 | Self-emp-not-inc | 209642 | HS-grad | 9 | Married-civ-spouse | Exec-managerial | Husband | White | Male | 0 | 0 | 45 | United-States | 1 |
| 8 | 31 | Private | 45781 | Masters | 14 | Never-married | Prof-specialty | Not-in-family | White | Female | 14084 | 0 | 50 | United-States | 1 |
| 9 | 42 | Private | 159449 | Bachelors | 13 | Married-civ-spouse | Exec-managerial | Husband | White | Male | 5178 | 0 | 40 | United-States | 1 |
| age | workclass | fnlwgt | education | education-num | marital-status | occupation | relationship | race | sex | capital-gain | capital-loss | hours-per-week | native-country | salary | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 990 | 46 | Private | 187370 | Bachelors | 13 | Never-married | Sales | Not-in-family | White | Male | 0 | 1504 | 40 | United-States | 0 |
| 991 | 41 | Private | 194636 | Assoc-voc | 11 | Married-civ-spouse | Machine-op-inspct | Husband | White | Male | 0 | 0 | 40 | United-States | 0 |
| 992 | 50 | Self-emp-not-inc | 124793 | HS-grad | 9 | Married-civ-spouse | Craft-repair | Husband | White | Male | 0 | 0 | 30 | United-States | 0 |
| 993 | 47 | Private | 192835 | HS-grad | 9 | Married-civ-spouse | Adm-clerical | Husband | White | Male | 0 | 0 | 50 | United-States | 1 |
| 994 | 35 | Private | 290226 | HS-grad | 9 | Never-married | Exec-managerial | Not-in-family | White | Male | 0 | 0 | 45 | United-States | 0 |
| 995 | 56 | Private | 112840 | HS-grad | 9 | Married-civ-spouse | Exec-managerial | Husband | White | Male | 0 | 0 | 55 | United-States | 1 |
| 996 | 45 | Private | 89325 | Masters | 14 | Divorced | Prof-specialty | Not-in-family | White | Male | 0 | 0 | 45 | United-States | 0 |
| 997 | 48 | Federal-gov | 33109 | Bachelors | 13 | Divorced | Exec-managerial | Unmarried | White | Male | 0 | 0 | 58 | United-States | 1 |
| 998 | 40 | Private | 82465 | Some-college | 10 | Married-civ-spouse | Machine-op-inspct | Husband | White | Male | 2580 | 0 | 40 | United-States | 0 |
| 999 | 39 | Self-emp-inc | 329980 | Bachelors | 13 | Married-civ-spouse | Exec-managerial | Husband | White | Male | 15024 | 0 | 50 | United-States | 1 |